Picture for Hang Yan

Hang Yan

MMG2Skill: Can Agents Distill In-the-Wild Guides into Self-Evolving Skills?

Add code
Jun 01, 2026
Viaarxiv icon

ChartAct: A Benchmark for Dynamic Chart Understanding

Add code
May 28, 2026
Viaarxiv icon

Dual-Dimensional Consistency: Balancing Budget and Quality in Adaptive Inference-Time Scaling

Add code
May 14, 2026
Viaarxiv icon

Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses

Add code
Apr 28, 2026
Viaarxiv icon

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

Add code
Apr 16, 2026
Viaarxiv icon

MM-Doc-R1: Training Agents for Long Document Visual Question Answering through Multi-turn Reinforcement Learning

Add code
Apr 15, 2026
Viaarxiv icon

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Add code
Feb 05, 2026
Viaarxiv icon

Steering LLMs via Scalable Interactive Oversight

Add code
Feb 04, 2026
Viaarxiv icon

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Add code
Feb 03, 2026
Viaarxiv icon

Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories

Add code
Sep 16, 2025
Figure 1 for Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories
Figure 2 for Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories
Figure 3 for Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories
Figure 4 for Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories
Viaarxiv icon